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We study the dynamics of the "Kolkata Paise Restaurant problem" . The problem is the following: 
", In each period, N agents have to choose between N restaurants. Agents have a common ranking of 

the restaurants. Restaurants can only serve one customer. When more than one customer arrives 
' at the same restaurant, one customer is chosen at random and is served; the others do not get the 

, service. We first introduce the one-shot versions of the Kolkata Paise Restaurant problem which we 

, ^ call one-shot KPR games. We then study the dynamics of the Kolkata Paise Restaurant problem 

' (which is a repeated game version of any given one shot KPR game) for large N. For statistical 

analysis, we explore the long time steady state behavior. In many such models with myopic agents 
we get under-utilization of resources, that is, we get a lower aggregate payoff compared to the social 
j- — ' optimum. We study a number of myopic strategies, focusing on the average occupation fraction of 

I ' restaurants. 

I. INTRODUCTION 

° : n 

, Following the earlier version of the Kolkata Restaurant Problem [1[ as a generalization of the El Farol Bar Problem 
r/3 ' Q and the consequent Minority Game Problem ||, we introduce a very simple version of the game: the Kolkata Paise 
Restaurant (KPR) problem, which is a repeated game with no mutual interaction. The set N of agents (restaurants) 

■ can be macroscopically large and agents have a common preference ranking across the N restaurants. We first 
introduce one-shot KPR games. Any given one-shot KPR game is repeated each period in the KPR problem. For 
any one-shot KPR game the set of pure strategy Nash equilibria coincides with the set of socially (Pareto) efficient 
outcomes. By socially efficient outcomes we mean outcomes where each agent goes to a separate restaurant and 
each restaurant gets exactly one agent to serve. If agents are rational and N is small, one can sustain any socially 
efficient outcome in equilibrium of the repeated game version of the KPR problem (see Q). However, we assume 
that N is large and hence an agent cannot rely on other agents actions and therefore what matters to any agent 

■ is the past collective configuration of actions and the resulting average utilization of restaurants. Assuming very 
'simple' stochastic strategies based on history and past individual experiences and assuming no mutual interaction, 

' we observe that the KPR problems are 'intrinsically dynamic' in the sense that, the system in general never converges 
to any of the states and most of the time, the system passes through socially inefficient states (for a special example 
of converging dynamics, see Sec. Illlf) . Our main objective is to analyze the statistics of the fraction of restaurants 

' utilized per day (utilization fraction /) associated with various stochastic strategies. In particular, we investigate 
some of the optimally utilized states of the system, the average occupation (utilization) fraction of the restaurants on 
^ ' any day and their fluctuations. 

•1— I . 

S_l 1 II. THE PROBLEM 

Let there be N restaurants in Kolkata and assume that each can accommodate a fixed number of customers, normalized 
to unity. Let there be exactly N customers or agents in the city. Though each of the restaurants costs the same, we 
assume that there is a common preference ranking of these restaurants to the agents. If, in a day, there is more than 
one agent (prospective customer) arriving in any particular restaurant, one of them is chosen randomly and the rest 
do not get the service. In Kolkata, there happened to exist very cheap and fixed rate "Paise Restaurants" (also called 
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"Paise Hotels"), popular among the daily laborers in the city. During lunch hours, the laborers used to walk down (to 
save the transport costs) to any of these restaurants and would miss the lunch if they arrived at a restaurant where 
their number is more than the capacity of the restaurant for such cheap lunch. Walking down to the next restaurant 
would mean failing to report back to the job in time! Paise means the smallest Indian coin and there were indeed some 
well known rankings of these restaurants as some of them would offer more tastier items compared to the others. 

Herd behavior and herd externality (see 0,@|) cannot explain the omnipresence of social inefficiency for the KPR 
problem since we assume that there is no ambiguity among agents over the ranking of the restaurants. The KPR 
problem is similar to the Minority game since it punishes herd behavior and promotes diversity (see Q). The KPR 
problem is different from the Minority game in the following ways: (a) while the minority game is a simultaneous move 
two choice problem, the KPR problem is a simultaneous move many choice problem and (b) for the KPR problem 
there is a common preference ranking of these restaurants and this sort of ranking is absent for the minority game. 

We assume that while deciding on the choice of the restaurant, there is no mutual interaction among the agents, and 
that they decide simultaneously on the basis of own past "experiences" and complete history. There are N N possible 
outcomes of which most are socially inefficient in the sense that there exists at least one agent not getting lunch and 
equivalently, there is at least one restaurant without a customer. There are exactly N\ (that is, a fraction exp[— N] 
as N — > oo ) outcomes that correspond to socially efficient utilization of the restaurants where each agent ends up in 
a different restaurant. Given that all agents pay the same price (though rankings are different), in general a socially 
efficient outcome is not individually optimal in this repeated set up. Therefore, even when there is a realization of 
the efficient outcome in one period, it gets destabilized in the next period and the whole system breaks down to 
any of the infinitely many (exp[A]; N — > oo) socially inefficient states. Hence there can be no fixed point result for 
the problem and the game continues dynamically forever. The system moves to an inefficient state characterized by 
overcrowding at higher ranked restaurants (absence of any "absorbing" state). It is precisely this generic deviation 
from the efficient outcome that motivated us to investigate some of the optimally utilized states of the system and the 
average occupation (utilization) fraction of the restaurants on any day. Therefore, our attempt is to analyze different 
versions and modifications of the KPR problem using various deterministic and stochastic strategies. 

III. ONE-SHOT KPR GAME 

How does one capture the features of the Kolkata Paise Restaurant problem of any given day? That is, the situation 
where on any given day 

(a) each agent goes for lunch to any one of the N restaurants, 

(b) depending on the number of agents in each restaurant only one agent is selected randomly and allowed to have 
lunch, and 

(c) the agent derives a utility from having lunch and this utility level depends on the rank of the restaurant and if 
the agent does not get lunch then his utility is zero on that particular day. 

One can use the tools available in the game theory literature to model the Kolkata Paise Restaurant problem of any 
given day as a one-shot game. We now introduce a general one-shot common preference restaurant game and identify 
the conditions under which it represents the one-shot KPR game. Let N = {I, . . . , N} be the set of agents (N < oo) 
and let the vector u — (u\, U2, ■ ■ ■ , un) & $l N represent the utility (in terms of money) associated with each restaurant 
which is common to all agents. Assume without loss of generality that < < . . . < U2 < u%. These three things 
(that is, J\f, u — [u\, . . . ,um) and the ranking < un < ... < u-i < u\) are used to define the one-shot common 
preference restaurant game. Formally, G(u) = (A/", S, II) represent a one-shot common preference restaurant game 
where Af = {I, . . . , N} is the set of agents (players), S = {1, . . . , N} is the (common) strategy space of all agents where 
a typical strategy Si = k denotes the strategy that the i-th agent's strategy is to go to the k-th restaurant. In other 
words, each agent on any given day selects any one of the N restaurants numbered 1 to N and if any agent i decides 
to go to restaurant k £ {I, . . . , N} then we represent it as Si — k. We denote by n = (Hi, . . . ,IIw) the expected 
payoff vector. In particular, H(s) = (Hi(s), . . . ,Hn(s)) is the expected payoff vector associated with any strategy 
combination s — (si, . . . , sn) 6 S N where player i's payoff is Hi(s) = n ^ and Ui(s) = 1 + \{j G M \ {1} \ Si = Sj}\. 

Therefore, the strategy combination s = (si, . . . , sjv) represents the selection of restaurants made by the agents and 
the expected payoff that an agent i £ TV gets depends not only on which restaurant he has selected but also on 
how many other agents have selected the same restaurant. The number of agents selecting the same restaurant as 
that of agent i under the strategy combination is given by n^s). Since we have assumed that the restaurant selects 
one agent randomly from the set of Ui(s) agents it follows that the expected payoff of agent i given the strategy 
combination s = (s%, . . . , sn) is Hi(s) = n 'A \ . For each selection of a vector u = (iti, 1*2, . . . , ujv) with the property 
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that < un < • ■ ■ < 1*2 < ui we associate a common preference restaurant game G(u) = (Af, S, II). We now introduce 
added restrictions on the common preference restaurant games to represent the one-shot KPR games. 

The first feature of the KPR problem is that agents have a common preference ranking over the restaurants. If all 
restaurants are identical to the agents (that is, if Ui = . . . , ujv) then the problem is trivial. Hence to introduce the 
problem in a non-trivial way it is reasonable to assume that for the one-shot KPR game u\ =/= un- The next feature 
of the KPR problem is that agents prefer getting lunch to not getting lunch. To capture this feature we assume that 
the payoff specification of a one-shot KPR game must be such that Uk> > ^ for all k, k' G {1, . . . , N} and given 
< un < ■ ■ ■ < U2 < u\ and u\ ^ un this is equivalent to assuming that un > Intuitively if un > -y- then 
even the person who was at the lowest ranked restaurant, would not be willing to alter his strategy. Therefore, if a 
one-shot common preference restaurant game G(u) — (TV, 5,11) is such that < un < • • ■ < u 2 < U\, u\ ^ un and 
un > ¥f then it is a one-shot KPR game. 

We say that a strategy combination s — (si, . . . , sn) leads to socially efficient (or Pareto efficient) outcome if 
and only if each agent goes to different restaurants that is Si ^ Sj for all i,j £ Af, i 7^ j. A strategy combination 
s = (si, . . . , sn) is a pure strategy Nash equilibrium if no agent has an incentive to deviate from his existing strategy 
Si given the strategy of the other players. For the above mentioned version of the one-shot KPR game the set of 
pure strategy Nash equilibria coincides with the set of socially efficient allocations. This is formally established in the 
appendix. 

In this section we have introduced the one-shot common preference restaurant games and identified conditions under 
which these games also represent the one-shot KPR games. The key features that separate out a one-shot KPR game 
from other non-KPR one-shot common preference restaurant games are non-triviality (u± 7^ ujv) and the restriction 
un > -y- that ensures that it is always in the interest of any agent to choose an unoccupied restaurant. 



It is well known from the repeated game theory literature that if agents can coordinate their actions then any pure 
strategy Nash equilibrium outcome of the one-shot KPR game (which is also a socially efficient outcome) can be 
sustained as an equilibrium outcome of KPR problem (the repeated one-shot KPR game) with appropriate punishment 
schemes (see Q). However, sustaining such equilibrium in repeated play of the one-shot KPR game would require 
that set of agents is small so that they can coordinate their actions. Since expecting such coordinated actions from 
the agents is unrealistic for the KPR problem (where N is large) we assume that with repetition of a one-shot KPR 
game agents work out their strategies without coordination. 

Following Brian Arthur's tradition we assume that agents reason inductively (see 0). By inductive reasoning we 
mean that agents learn from past experiences, discard their old strategy for something new if it performs badly and 
continues with the old strategy if it performs well. However, our objective in this paper is not to find out strategies 
that survive this process of inductive reasoning but to analyze the statistics of the fraction of restaurants utilized 
per day associated with each such strategy. We assume that all agents take up some (common) stochastic strategy 
independent of each other. By stochastic strategies we mean that agents choose their strategies in a probabilistic way. 
At any step this choice of probabilities can depend on the outcome of the previous period. In each case we consider 
the limit N — > 00 and analyze the statistics of the fraction / of the restaurants utilized per period. The specific 
one-shot KPR game G(u) that we use is the one where < un < ■ ■ ■ < U\ so that agents have a strict ranking and 



On each day t an agent selects any one restaurant with equal probability. There is no memory or learning and each 
day the same process is repeated. 

If any restaurant is chosen by m(> 1) agents on a particular day t, the restaurant chooses one of them randomly, 
and the rest m — 1 agents go without any lunch that day. Let there be XN agents (A = 1 in KPR problem) choosing 
randomly from the set of N restaurants. Then the probability p of choosing any restaurant is l/N. The probability 
P(m) that any restaurant is chosen simultaneously by m agents is therefore given by Poisson distribution 



IV. SIMPLE STOCHASTIC STRATEGIES AND UTILIZATION STATISTICS 




A. No learning (NL) strategy 
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FIG. 1: Numerical simulation results for the normalised (by the respective half widths) distribution Do(f) of the fraction / 
of people getting lunch on any day (or fraction of restaurants occupied) against A/ = / — / where / denote the average 
occupancies in each cases. The inset shows the bare distributions D(f) against A/. All the numerical simulations have been 
done for N = 1000 agents for 10 6 time steps. / ~ 0.63 in case NL, f ~ 0.43 in case LL(1) and / ~ 0.77 in case OPR, as 
estimated analytically (Eqns. ([2]), (|3} and ((U). The data points are (+), (□) and (O) f° r cases NL, LL(1) and OPR respectively. 

In other words, the fraction of restaurants not chosen any day is given by P(m = 0) = cxp(— A), giving the average 
fraction of restaurants occupied 

7=1- exp(-A) ~ 0.63 for A = 1, (2) 

in the KPR problem. The distribution D(f) of the fraction utilised any day will be a Gaussian around the average 
given above. See the simulation results in Figure [T]for N — 1000. 

B. Limited learning (LL) strategy 

We consider two types of learning strategies and show that in both the cases the utilization fraction / goes down 
compared to the no learning case. 

LL(1) If an agent gets lunch on day t then the agent goes to the best restaurant on day t + 1. Moreover, if on some 
day t the agent goes to best restaurant and fails to get lunch then, on day t + 1, the agent selects a restaurant 
from the remaining N — 1 restaurants with equal probability. 

When f t N number of agents get their lunch on any day t and decide to get their next day ((i + l) th day) lunch in 
the best ranked restaurant, then only one of them will get into the highest ranked restaurant and others will not get 
their lunch. The rest N — f t N agents will try independently for the remaining N — 1 restaurants, following the same 
strategy as in the no learning case. Then the recursion relation will simply be (following Equation ((H)): 

/ t+1 = l-exp(-A t ); Xt = l- ft (3) 

giving the fixed point fraction / ~ 0.43. This again compares well with the numerical simulation results (for N = 1000) 
and shows a Gaussian distribution D(f) around the above value of / as in Figure [TJ 

LL(2) If an agent reaches the k-th ranked restaurant on day t then the agent's strategy is the following: (a) if the agent 
gets lunch on day t then, on day t+1, the agent attempts to select a restaurant from the rank set {k — 1, . . . , 1} 
with equal probability and (b) if the agent does not get lunch on day t then, on day t+1, the agent attempts to 
select a restaurant from the set {fc, . . . , N} with equal probability. 
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For this strategy we get the fixed point fraction / ~ 0.5 and it shows a Gaussian distribution around /. Hence 
in this case also there is no improvement compared to the NL case. If we can somehow introduce repetition in the 
strategy of the agent then we get an improvement in the utilization fraction. In fact, since the reverse choice problem, 
where for (a) the choice would be {k, . . . , N} and for (b) would be {k — 1, . . . , 1}, is identical, we get / = 1 — /, giving 
/ = 0.5. 



C. One period repetition (OPR) strategy 

If an agent gets lunch on day t from restaurant k then the agent goes back to the same restaurant on day (t+ 1) and 
competes for the best restaurant (restaurant 1) on day (t + 2). If the agent fails to get lunch on day t then the agent 
tries for a restaurant on day (t + 1) which was vacant on day t, using the same stochastic strategy as in the NL case. 

The fraction f t on any day t of the people occupying any restaurant consists of two parts: fraction Xt-i of the 
people already continuing in their randomly selected restaurant the earlier day and the fraction x t of people who have 
chosen today. As such, using Equation (|2]|. 

/ t = i t _ 1 + (l-a: t _ 1 )[l-exp(-l)]. (4) 

Since the fraction x t chosen today is given by the fraction / in Equation |(3J| where N(l — x t -\) left out agents are 
choosing randomly out of N(l — Xt-i) vacant restaurants. The next day, the fraction f t +i will therefore be given by 

/t+i - (1 - x t )(l - exp(-l)) + [1 - (1 - x t )(l - cxp(-l))][l - exp(-l)]. (5) 

If we assume that f t +\ = f = ft and x t = x at the fixed point, then equating jlj and (|5|) 

x = [1 - (1 - x)(l - exp(-l))](l - exp(-l)), (6) 

or x ~ 0.38, giving / ~ 0.77, as can be seen in the numerical simulation results in Figure [TJ The fluctuations in 
the occupation density or utilization fraction D(f) is again Gaussian around the most probable occupation fraction 
derived here. Therefore, with OPR strategy we have an increase in / compared to the NL strategy suggesting that 
if the strategies are such that an agent, after getting lunch on a particular day, decides to go to the same restaurant 
next day we have an improvement in utilization fraction. 

Remark The OPR strategy is more consistent when the one-shot common preference restaurant game G(u) is such 
that u = (ui = . . . = un). It may be noted that if all the restaurants are identical then the dynamics always converge 
to an absorbing state characterized by social efficiency (with Pavlov's win-stay, lose-shift (WS-LS) strategy (see 
Q)). The reason being the following: even if everybody starts by random choices among all the TV restaurants and, 
initially (say, on day t), several restaurants get more than one prospective customer and choose one of them, on day 
t + 1, the agents not getting lunch on day t will go to a restaurant that was unoccupied on day t and the agents 
getting lunch on day t will continue to go to the same restaurant. Soon (before, N + 1 days), each agent finds one 
restaurant to get the lunch and the dynamics effectively stops there. 

In the next case we think of a strategy where the queue length in a restaurant on day t either acts as an inducing 
device or acts as a rationing device for the strategy to be adopted by an agent on day t+1. 

D. Follow the crowd (FC) strategy 

The probability of selecting a restaurant k on day t + 1 depends on the queue length of agents in restaurant k on day 
t. 

We assume that the probability pk(t + 1) of choosing k th restaurant on (t + l) th day is given by 

«« + » = if^ < 7 » 

where quit) is the queue length (or equivalently, the number of people who arrived at the fc th restaurant on day t). 
r is a parameter that interpolates between the random (r — > oo, independent of history) and condensation (r — > 0, 
strongly dependent only on history) limits. The representative histograms are shown in the Figure 2, where the most 
probable fraction / of occupied restaurants converge to a value 0.63 (as obtained earlier) as r — ► oo. It is also seen that 
the probability of a queue length q of any restaurant goes as g" 1 for r — > (exponentially decaying otherwise). This 
implies that while the utilization fraction has a Gaussian distribution (with / = 0.63) the queue length distribution 
on any arbitrary day t has a power law distribution if r — ► and it is exponentially decaying otherwise. 
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FIG. 2: Numerical simulation results for the distribution D(f) of the fraction / of people getting lunch any day (or fraction of 
restaurants occupied on any day) against /, for r — 0.1, 1.0 and 9.0 for case FC. All the numerical simulations have been done 
for N = 512 agents for 10 6 time steps. 



If the agents follow avoid the crowd (AC) strategy that is if Pk(t) — > 1 — Pk(t) (people choosing to avoid the last 
day's crowded restaurants), the most probable restaurant utilization fraction / ~ 0.63 is independent of the value of 
r (> 0). Here, no power law for the queue length was observed for any value of r. The observations of this subsection 
are similar to the Hospital waiting list size distribution (see Q, @) though the Hospital problem is significantly 
different from the KPR problem (for example, the absence of possibilities of iterative learning and the presence of 
heterogeneity of choices in the Hospital waiting list problem). 



V. LIMITED QUEUE LENGTH AND MODIFIED KPR PROBLEM 

Till now we had assumed that on any day t one can have potentially all the N agents turning up in one particular 
restaurant. If the arrival of the agents were sequential then all agents turning up in one restaurant tantamount to 
saying that queues of all sizes are allowed to form in any restaurant and the restaurant then picks up any one agent 
to serve. All other agents have to go back without having lunch on day t. Here we are assuming that formation of 
queue of all sizes is allowed. What if all restaurants have limited capacity to accommodate agents in a queue and that 
an agent gets the option to move to another restaurant (in the same day) if the agent arrives at a restaurant that has 
reached its queue limit? Before concluding we consider this particular modification of the KPR problem since it has 
important implications about the robustness of our occupancy statistics /. 

A modified KPR problem: On each day the restaurants are filled up sequentially. We also assume that each restaurant 
allows for a queue length q* , so that the (q* + l) th person arriving at the fc th restaurant has to search for another 
restaurant with a queue length less than q* . When all the customers have arrived, one of the qi < q* customers gets 
lunch at the k th restaurant, while the cji — 1 number of prospective customers do not get any lunch that day. In terms 
of the dynamics, the most probable restaurant utilization fraction / = 1 occurs for q* = 1. We find that for q* = 2, 
the fraction / drastically falls to 0.68. Also for q* N, f — 0.63 as discussed and derived earlier. Figure 3 shows 
the result for different values of q* . Note that / approaches the value for the random occupancy case as q* becomes 
large. Thus, except for the special case where q* = 1, for all other cases we get conclusions that are closer to the KPR 
problem derived in the previous section. 
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FIG. 3: Numerical simulation results for the distribution D(f) of the fraction / of people getting lunch any day (or fraction 
of restaurants occupied on any day) against /, for q* — 2, 3, 4, 5 for the modified KPR problem. All the numerical simulations 
have been done for N = 1024 agents for 10 6 time steps. 



VI. DISCUSSION AND CONCLUSION 



For the general KPR problem we have a perpetual dynamics characterized by large fluctuations in the measure of 
daily misuse having a Gaussian distribution of utilization fraction / around the average / where / differs for different 
strategies. We have argued this point by showing that the average utilization fraction / with no learning and other 
simple stochastic strategies could be analytically calculated and the distributions are seen to be Gaussian. For the 
"follow/avoid the crowd" strategies we observe the statistics of the number of people going without lunch to have 
power law fluctuations (see also [3] , , , @ for similar behavior) . 

Our main findings in terms of utilization fraction comparisons across different strategies are: 

(a) If we have a one-shot KPR game then any pure strategy Nash equilibrium leads to a socially efficient outcome, 
as shown in Sec. lIIIl and hence we have full capacity utilization. However, if we have a KPR problem (which is a 
repeated game where any given one-shot KPR game is repeated) and agents cannot coordinate their action then 
we do not have full capacity utilization for many (common) stochastic strategies. In such a situation utilization 
fraction under different stochastic strategies is of prime importance. 

(b) The utilization fraction is higher on an average with no learning strategy in comparison to some limited learning 
strategies (Sec. IIV|) . 

(c) The statistics for utilization fraction can improve with a strategy where each agent, after getting lunch from a 
restaurant on day t, goes back to the same restaurant on day (t + 1) (ignoring the ranking of the restaurants) 
(Sec. lTVCj) . 

(d) Following (avoiding) the crowd strategy cannot improve on the statistics of utilization fraction in comparison 
to the no learning case (Sec. IIV D|) . 

(e) Our findings about the statistics of utilization fraction are robust since we have shown that by adding different 
queue limits to the KRP problem the conclusions are similar to our findings except for the very special case 
where the queue limit is unity (Sec. fV|) . 
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VII. APPENDIX 



A strategy combination s G S N for the one-shot common preference restaurant game G(u) = (Af, S, II) leads to 
socially (Pareto) efficient outcome if and only if rii(s) = 1 for all i G N. In other words, given any one-shot common 
preference restaurant game G(u) a strategy combination leads to a socially efficient outcome if and only if each agent 

goes to a different restaurant so that the sum of utilities of the agents is Ylk=i Uk - Let s ~ i denote the strategy 
combination of all but player i. A strategy combination s* G S is a pure strategy Nash equilibrium of the one-shot 
common preference restaurant game G(u) = (A/", S, II) if for all i G Af, IL,(s*) > Hi (sj, s*_f) for all S{ G S. For a typical 
one-shot common preference restaurant game G(u) — (Af,S,H), let SE(G(u)) — {s* G | n,-(s*) = IVi G Af} 
denote the set of all strategy combinations that lead to socially efficient allocations and let PN(G(u)) = {s* G \ 
Vi G Af,Ui(s*) > II,j(sj, slj V Si G S} denote the set of all pure strategy Nash equilibria. Let G(u) represent any 
one-shot KPR game we introduced in Sec. IIII1 that is any one-shot common preference restaurant game with the 
property that < un < . . . < u± < 2un, u\ ^ un- We have the following result. 
Proposition: For any one-shot KPR game G(u), SE(G(u)) = PN(G(u)). 

Proof: Consider any one-shot KPR game and a strategy combination s* G SE(G(u)). Take any agent i G Af and 
any deviation Si — k ^ s* — k' . Since the number of agents is equal to the number of restaurants and since for a 
strategy combination s* G SE(G(u)), rii(s*) = 1 for all i G Af, it follows that rii(si, s*_f) = 2 > rii(s*) = 1. Therefore, 
n^SijSli) = -rr < = IL(s*) since for the one-shot KPR game < un < • • ■ < «i < 2un ^> < u^ for all 
k, k' E {1, . . . , N} and hence agent i has no incentive to deviate. Since the selection of i was arbitrary it follows that 
s* G PN(G(u)) implying SE(G(u)) C PN(G(u)). 

To prove PN(G(u)) C SE(G(u)) consider any strategy combination s_j for all but player i. Given s-i, the 
best response of agent i is to select a best ranked restaurant from the set UAfi(s-i) = {k G Af | Sj ^ k V j ^ i} of 
unoccupied restaurants. The reasons being (1) given ^ < ujy, ^ < Uk> for all k, k' G {1, . . . , N} and all integers q > 2 
implying that given the strategy of all other agents, agent i strictly prefers to occupy a best unoccupied restaurant 
as opposed to either occupying a different unoccupied restaurant (which is not a best restaurant) or crowding in an 
occupied restaurant and (2) given the number of agents is equal to the number of restaurants, UAfi(s-i) ^ for any 
S-i G S*^ -1 implying that given any strategy of all other agents, agent i can always find an unoccupied restaurant 
and hence agent i can also find a best unoccupied restaurant since < un < ... < Ui < 2ujy. Therefore, it is in the 
interest of each agent to occupy an unoccupied restaurant and given the fact that no matter what others do an agent 
can always find an unoccupied restaurant it follows that any pure strategy Nash equilibrium for the one-shot KPR 
game necessarily leads to a socially efficient allocation, that is PN(G(u)) G SE(G(u)). 
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